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1 Lon g term file mi g ration: development and evaluation of al g orithms 
Alan Jay Smith 



August 1981 Communications of the ACM, volume 24 issue 8 
Publisher: ACM Press 

Additional Information: full citation, abstract , references, citings, index 
t e rms 



Full text available: ffl pdfd.26 MB) 



The steady increase in the power and complexity of modern computer systems has 
encouraged the implementation of automatic file migration systems which move files 
dynamically between mass storage devices and disk in response to user reference 
patterns. Using information describing 13 months of user disk data set file references, we 
develop and evaluate (replacement) algorithms for the selection of files to be moved from 
disk to mass storage. Our approach is general and demonstrates a general ... 

Keywords: file migration, mass storage, memory hierarchies, paging, replacement 
algorithm . 



2 Fast detection of communication patterns in distributed executions 
Thomas Kunz, Michiel F. H. Seuren 

November 1997 Proceedings of the 1997 conference of the Centre for Advanced 
Studies on Collaborative research CASCON '97 

Publisher: IBM Press 

Full text available: ^[ pdf(4.21 MB ) Additional Information: full c itation , abstract, references, index terms 

Understanding distributed applications is a tedious and difficult task. Visualizations based 
on process-time diagrams are often used to obtain a better understanding of the 
execution of the application. The visualization tool we use is Poet, an event tracer 
developed at the University of Waterloo. However, these diagrams are often very complex 
and do not provide the user with the desired overview of the application. In our 
experience, such tools display repeated occurrences of non-trivial commun ... 

3 Data base directions: the next ste ps 
John L. Berg 

November 1976 ACM SIGMOD Record , ACM SIGMIS Database, volume 8 , 8 issue 4 , 2 
Publisher: ACM Press 

Full text available: ^[ pdf ( 9.95 MB) Additional Information: full citation , abstract , citing s 

What information about data base technology does a manager need to make prudent 
decisions about using this new technology? To provide this information the National 
Bureau of Standards and the Association for Computing Machinery established a workshop 
of approximately 80 experts in five major subject areas. The five subject areas were 
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auditing, evolving technology, government regulations, standards, and user experience. 
Each area prepared a report contained in these proceedings. The proceedings p ... 

Keywords: DBMS, auditing, cost/benefit analysis, data base, data base management, 
government regulation, management objectives, privacy, security, standards, technology 
assessment, user experience 



4 Stora g e systems: Position: short ob j ect lifetimes require a delete-optimized stora ge 
^ s ystem 

^ Fred Douglis, John Palmer, Elizabeth S. Richards, David Tao, William H. Tetzlaff, John M. 
Tracey, Jian Yin 

September 2004 Proceedings of the 11th workshop on ACM SIGOPS European 
workshop: beyond the PC EW11 

Publisher: ACM Press 

Full text available: pdfd 46.32 KB) Additional Information: full citation , abstract , references , citings 

Early file systems were designed with the expectation that data would typically be read 
from disk many times before being deleted; on-disk structures were therefore optimized 
for reading. As main memory sizes increased, more read requests could be satisfied from 
data cached in memory, motivating file system designs that optimize write performance. 
Here, we describe how one might build a storage system that optimizes not only reading 
and writing, but creation and deletion as well. Efficiency is a ... 

5 National id card: the next g eneration: The US/Mexico border crossin g card (BCC): a 
^ case study in biometric. machine-readable id 

^ Andrew Schulman 

April 2002 Proceedings of the 12th annual conference on Computers, freedom and 
privacy CFP '02 

Publisher: ACM Press 

Full text available: g) htm( 18 7.31 KB) Additional Information: full citation, index t e rms 



6 Short papers -- works in prog ress: Toward a threat model for stora g e systems 
Ragib Hasan, Suvda Myagmar, Adam J. Lee, William Yurcik 

November 2005 Proceedings of the 2005 ACM workshop on Storage security and 
survivability StorageSS '05 

Publisher: ACM Press 

Full text available: ^pdf(258.24 KB) Additional Information: full citation , abstrac t, references , index terms 

The growing number of storage security breaches as well as the need to adhere to 
government regulations is driving the need for greater storage protection. However, there 
is the lack of a comprehensive process to designing storage protection solutions. 
Designing protection for storage systems is best done by utilizing proactive system 
engineering rather than reacting with ad hoc countermeasures to the latest attack du 
jour. The purpose of threat modeling is to organize system threats and vulnera ... 

Keywords: security, storage system, threat model 




7 Level II technical su p por t in a di st ribute d com putin g environment 
Tim Leehane 

September 1996 Proceedings of the 24th annual ACM SIGUCCS conference on User 

services SIGUCCS 96 
Publisher: ACM Press 

Full text available: ^ pdf(5.73 MB ) Additional Information: full citation , references , index terms 
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8 Digest of proceedings seventh IEEE workshop on hot topics in operating systems 

^ March 29-30 1 9 99. Rio Rico , AZ 
^ M. Satyanarayanari 

October 1999 ACM SIGOPS Operating Systems Review, volume 33 issue 4 

Publisher: ACM Press 

Full text available: ^[pdf (1.67 MB) Additional Information: full citation , abstract , index terms 

The Seventh IEEE Workshop on Hot Topics in Operating Systems was held on March 29- 
30 1999 at the Rio Rico Resort & Country Club, south of Tucson, Arizona. The 
General Chair, Peter Druschel, and the Local Arrangements Chair, John Hartman, had 
gone to considerable effort to make the operation of the workshop smooth and pleasant 
for the participants. The secluded desert locale, the effect of brilliant sunshine and blue 
skies on winter-jaded northerners, and the enthusiasm and energy of the ... 

9 Trustworthy 100-year dig ital ob j ects: durable encodin g for when it's too late to ask 
H. M. Gladney, R. A. Lorie 

July 2005 ACM Transactions on Information Systems (TOIS), volume 23 issue 3 
Publisher: ACM Press 

Full text available* 151 pdfd 04 MB) Additional Information: full citation , abstract , references , citings , index 
. [2d terms 

How can an author store digital information so that it will be reliably intelligible, even 
years later when he or she is no longer available to answer questions? Methods that might 
work are not good enough; what is preserved today should be reliably intelligible 
whenever someone wants it. Prior proposals fail because they generally confound saved 
data with irrelevant details of today's information technology— details that are difficult to 
define, extract, and save completely and accurate ... 

Keywords: Long-term digital preservation, encoding 



10 Garbage collectin g the Internet: a survey of distributed g arba g e collection 
Saleh E. Abdullahi, Graem A. Ringwood 

September 1998 ACM Computing Surveys (CSUR), volume 30 issue 3 
Publisher: ACM Press 

Full text available* fl!l pdf(337 65 KB) Add'*' 01 " 131 Information: ful l citation , abstract, references , citings, i ndex 
. • terms , review 

Internet programming languages such as Java present new challenges to garbage- 
collection design. The spectrum of garbage-collection schema for linked structures 
distributed over a network are reviewed here. Distributed garbage collectors are classified 
first because they evolved from single-address-space collectors. This taxonomy is used as 
a framework to explore distribution issues: locality of action, communication overhead 
and indeterministic communication latency. 

Keywords: automatic storage reclamation, distributed, distributed file systems, 
distributed memories, distributed object-oriented management, memory management, 
network communication, object-oriented databases, reference counting 



11 HOPT: A myopic version of the STOCHQPT automatic file migration polic y 
Frank Olken 

August 1983 Proceedings of the 1983 ACM SIGMETRICS conference on Measurement 
and modeling of computer systems SIGMETRICS '83 

Publisher: ACM Press 

Full text available- fiD odf(382 82 KB) Additional Information: full citation , abstract , references , citin gs, index 

: terms 

The STOCHOPT automatic file migration policy (proposed by A. J. Smith) minimizes the 
expected retention and recall costs of an abitrarily sized file. We consider the application 
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of the STOCHOPT policy to a file system in which the file inter-reference time (IRT) 
distributions are characterized by strictly monotonically decreasing hazard rates (SDHR) 
(also known as decreasing failure rates, DFR). We show that in this case the STOCHOPT 
policy can be simply stated in terms of a scaled hazard ra ... 

1 2 D ynamic file mi gration in distribute d com puter s ystems 
^ Bezalel Gavish, Olivia R. Liu Sheng 

>/ February 1990 Communications of the ACM, Volume 33 issue 2 
Publisher: ACM Press 

Full text available* H3 pdf d 53 MB) Additional Information: f ull ci tatio n, abstract, references, citings, index 
^ terms , review 

The importance of file migration is increasing because of its potential to improve the 
performance of distributed office, manufacturing and hospital information systems. To 
encourage research in the file migration problem, the authors summarize 
accomplishments of researchers of the problem, provide a detailed comparison of file 
migration and dynamic file allocation problems, and identify important areas of research 
to support the development of effective file migration policies. 

13 S pecial issue on persistent ob j ec t s ystems: Ortho g onally persistent ob j ect systems 
Malcolm Atkinson, Ronald Morrison 

July 1995 The VLDB Journal — The International Journal on Very Large Data Bases, 

Volume 4 Issue 3 
Publisher: Springer-Verlag New York, Inc. 

Full text available: pdf(5.02 MB) Additional Information: full citation , abstract , references , citing s 

Persistent Application Systems (PASs) are of increasing social and economic importance. 
They have the potential to be long-lived, concurrently accessed, and consist of large 
bodies of data and programs. Typical examples of PASs are CAD/CAM systems, office 
automation, CASE tools, software engineering environments, and patient-care support 
systems in hospitals. Orthogonally persistent object systems are intended to provide 
improved support for the design, construction, maintenance, and operation o ... 

Keywords: database programming languages, orthogonal persistence, persistent 
application systems, persistent programming languages 



14 Smalltalk-80: the lan guag e and its implementation 
Adele Goldberg, David Robson 

January 1983 Book 

Publisher: Addison-Wesley Longman Publishing Co., Inc. 

Full text available: ^ pdf(33. 56 MB) Additional Information: full citation, abstract, cited by. index terms, review 
From the Preface (See Front Matter for full Preface) 

Advances in the design and production of computer hardware have brought many more 
people into direct contact with computers. Similar advances in the design and production 
of computer software are required in order that this increased contact be as rewarding as 
possible. The Smalltalk-80 system is a result of a decade of research into creating 
computer software that is appropriate for producing highly functional and interactive ... 

15 Some assembl y r equired: bui ldi n g a digital g overnment fo r th e 2 1^ century 
Sharon S. Dawes, Peter A. Bloniarz, Kristine L. Kelly, Patricia D. Fletcher 

May 2002 Proceedings of the 2002 annual national conference on Digital 

government research dg.o '02 
Publisher: Digital Government Research Center 

Full text available: ^ pdf(889.22 KB) Additional Information: full citation , references 
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16 Some assembly required: buildin g a di g ital g overnment for the 21 ^ century 
Sharon S. Dawes, Peter A. Bloniarz, Kristine L. Kelly, Patricia D. Fletcher 

May 2000 Proceedings of the 2000 annual national conference on Digital 

government research dg.o 'OO 
Publisher: Digital Government Research Center 

Full text available: g pdf(889.21 KB) Additional Information: full citation , abstract , references , citing s 

This material is based upon work supported in part by the National Science Foundation 
under Grant No. 99-181. Any opinions, findings, and conclusions or recommendations 
expressed in this material are those of the authors and do not necessarily reflect the 
views of the National Science Foundation. 

17 O peratin g s ystems in a chan ging world 
Maurice Wilkes 

April 1994 ACM SIGOPS Operating Systems Review, Volume 28 issue 2 
Publisher: ACM Press 

Full text available: ^] pdf(71 1 .06 KB) Additional Information: full citation, index terms 




18 Secure file system versionin g at the block level 
vsg^ Jake Wires, Michael J. Feeley 

v March 2007 ACM SIGOPS Operating Systems Review , Proceedings of the 2007 
conference on EuroSys EuroSys '07, volume 41 issue 3 
Publisher: ACM Press 

Full text available: ^ pdf( 406.68 KB) Additional Information: full citation , abstract , references 

In typical file systems, valuable data is vulnerable to being accidentally or maliciously 
deleted or overwritten. Versioning file systems protect data from accidents by 
transparently retaining old versions, but do less well in protecting data from malicious 
attack. These systems remain vulnerable to attackers who gain unauthorized access to 
prune old file versions, who bypass the file system to directly manipulate storage, or who 
exploit bugs in any part of the operating system. 
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19 Web engineering: Just-in-time recovery of missing web pages 
Terry L. Harrison, Michael L. Nelson 

August 2006 Proceedings of the seventeenth conference on Hypertext and 
hypermedia HYPERTEXT 06 

Publisher: ACM Press 



Full text available: « gpdf(911.47 KB) 



Additional Information: full c i ta tion, abstract, references , citin gs, index 
terms 



We present Opal, a light-weight framework for interactively locating missing web pages 
(http status code 404). Opal is an example of "in vivo" preservation: harnessing the 
collective behavior of web archives, commercial search engines, and research projects for 
the purpose of preservation. Opal servers learn from their experiences and are able to 
share their knowledge with other Opal servers by mutual harvesting using the Open 
Archives Initiative Protocol for Metadata Harvesting (OAI-PMH). Using ... 

Keywords: 404 web pages, apache web server, digital preservation 



20 Conserving network proce s sor po we r co nsumption by ex ploit ing traffi c var iability 
Yan Luo, Jia Yu, Jun Yang, Laxmi N. Bhuyan 

March 2007 ACM Transactions on Architecture and Code Optimization (TACO), volume 4 

Issue 1 
Publisher: ACM Press 

Full text available:^ pdf(595. 99 KB) Additional Information: full citation, abstract, references, index terms 
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Network processors (NPs) have emerged as successful platforms for providing both high 
performance and flexibility in building powerful routers. Typical NPs incorporate 
multiprocessing and multithreading to achieve maximum parallel processing capabilities. 
We observed that under low incoming traffic rates, processing elements (PEs) in an NP 
are idle for most of the time but still consume dynamic power. This paper develops a low- 
power technique to reduce the activities of PEs in accordance with ... 

Keywords: Network processor, clock gating, low power, scheduling 
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1 The hi gh performance storage system 
R. A. Coyne, H. Hulen, R. Watson 

December 1993 Proceedings of the 1993 ACM/IEEE conference on Supercomputing 
Supercomputing '93 

Publisher: ACM Press 

Full text available: ^ p df (1.05 MB) Additional Information: full citation , references, citings, index terms 



2 Integrating Metadata Tools with the Data Services Archive to Provide Web-based 
Management of Large-Scal e Sci enti f i c Si mula tion D ata 
Victor P. Holmes, Wilbur R. Johnson, David J. Miller 

April 2004 Proceedings of the 37th annual symposium on Simulation ANSS '04 
Publisher: IEEE Computer Society 

Full text available: ^ pdf(226.75 KB) Additional Information: full citation , abstract, index terms 

A Metadata tools system and a Data Services systemare undergoing development and 
integration, at SandiaNational Laboratories, to provide web-based access tohigh- 
performance computing clusters and its associatedsimulation data. These clusters host a 
set of scalable post-processing applications for very large data manipulation, and the 
examination of results generated by high-fidelitysimulations in support of the design to 
analysis process forensuring safety and reliability of the nation's nuclear ... 



Buildin g and su pp ortin g a m as sive data infrastructure for the masses 
Anurag Shankar, Gustav Meglicki, Jeff Russ, Haichuan Yang, E. Chris Garrison 
November 2002 Proceedings of the 30th annual ACM SIGUCCS conference on User 

services SIGUCCS '02 
Publisher: ACM Press 

Additional Information: full citation , abstract , references , citings, index 
te rms 



Full text available: ^ pdf( 526.33 KB) 



On a typical university campus, the words "massive data storage" (MDS) usually bring to 
mind technology high-end, high performance computing (HPC) users might use. This is 
because academic supercomputer sites have traditionally provided a tightly interwoven 
HPC and high performance, MDS fabric to their users for decades. However, a new 
paradigm in data storage is now emerging where large, central, hierarchical. storage 
management (HSM) services may play an increasingly important role in the non-H ... 



Keywords: DCE, DFS, HPSS, HSM, distributed storage, hierarchical storage 
management, massive data storage, support 
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4 Memory: Couplin g prefix cachin g and collective downloads for remote dataset access 
Xiaosong Ma, Vincent W. Freeh, Tao Yang, Sudharshan S. Vazhkudai, Tyler A. Simon, 
Stephen L. Scott 

June 2006 Proceedings of the 20th annual international conference on 
Supercomputing ICS '06 

Publisher: ACM Press 

Full text available: « pdf ( 490.30 KB) Additional Information: full citation , abstract, references , cjtedby, index 
10 terms 

Scientific datasets are typically archived at mass storage systems or data centers close to 
supercomputers/instruments. End-users of these datasets, however, usually perform 
parts of their workflows at their local computers. In such cases, client-side caching can 
offer significant gains by reducing the cost of wide-area data movement. Scientific data 
caches, however, traditionally cache entire data-sets, which may not be necessary. In this 
paper, we propose a novel combination of prefix caching < ... 

5 Re sea rch d a ta storage available to researchers throughout the U.S. via the T eraGrid 
D. Scott McCaulay, Matt R. Link 

November 2006 Proceedings of the 34th annual ACM SIGUCCS conference on User 

services SIGUCCS 06 
Publisher: ACM Press 

Full text available: Q pdfd 39.44 KB ) Additional Information: full citation , abstract , references , index terms 

Many faculty members at small to mid-size colleges and universities do important, high 
quality research that requires significant storage. In many cases, such storage 
requirements are difficult to meet with local resources; even when local resources suffice, 
data integrity is best ensured by maintenance of a remote copy. Via the nationally-funded 
TeraGrid, Indiana University offers researchers at colleges and universities throughout the 
US the opportunity to easily store up to 1 TB of data withi ... 

Keywords: TeraGrid, grid computing, storage: large-scale 



Constructing collaborative desktop storage caches for large scientific datasets 
Sudharshan S. Vazhkudai, Xiaosong Ma, Vincent W. Freeh, Jonathan W. Strickland, Nandan 
Tammineedi, Tyler Simon, Stephen L. Scott 

August 2006 ACM Transactions on Storage (TOS), volume 2 issue 3 
Publisher: ACM Press 

Full text available: Q pdf(833.76 KB ) Additional Information: full citation , abs tr a ct, references , index term s 

High-end computing is suffering a data deluge from experiments, simulations, and 
apparatus that creates overwhelming application dataset sizes. This has led to the 
proliferation of high-end mass storage systems, storage area clusters, and data centers. 
These storage facilities offer a large range of choices in terms of capacity and access rate, 
as well as strong data availability and consistency support. However, for most end-users, 
the "last mile" in their analysis pipeline o ... 

Keywords: Distributed storage, parallel I/O, scientific data management, serverless 
storage system, storage cache, storage networking, storage resoucce management, 
storage scavenging, striped storage 



FreeLoader: Scaven g ing Desktop Stora g e Resources for Scientific Data 
Sudharshan S. Vazhkudai, Xiaosong Ma, Vincent W. Freeh, Jonathan W. Strickland, Nandan 
Tammineedi, Stephen L. Scott 

November 2005 Proceedings of the 2005 ACM/IEEE conference on Supercomputing SC 
•05 

Publisher: IEEE Computer Society 

Full text available: pdf(410.23 KB) Additional Information: full citation , abstract , index terms 
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High-end computing is suffering a data deluge from experiments, simulations, and 
apparatus that creates overwhelming application dataset sizes. End-user workstations- 
despite more processing power than ever before-are ill-equipped to cope with such data 
demands due to insufficient secondary storage space and I/O rates. Meanwhile, a large 
portion of desktop storage is unused. We present the FreeLoader framework, which 
aggregates unused desktop storage space and I/O bandwidth into a shared cache/sc ... 

Keywords: Distributed storage, storage scavenging, storage cache, serverless storage 
system, scientific data management, parallel I/O, striped storage 



8 The SDSC s tora g e resource broker 

Chaitanya Baru, Reagan Moore, Arcot Rajasekar, Michael Wan 

November 1998 Proceedings of the 1998 conference of the Centre for Advanced 
Studies on Collaborative research CASCON '98 

Publisher: IBM Press 

Full text available- IB Ddfd 53 81 KB) Additional Information: full citation , abstract , references , citings, index 
. "gjjLa : terms 

This paper describes the architecture of the SDSC Storage Resource Broker (SRB). The 
SRB is middleware that provides applications a uniform API to access heterogeneous 
distributed storage resources including, filesystems, database systems, and archival 
storage systems. The SRB utilizes a metadata catalog service, MCAT, to provide a 
"collection"- oriented view of data. Thus, data items that belong to a single collection 
may, in fact, be stored on heterogeneous storage systems. The SRB infrastruct ... 

9 A novel a p plication development environment for large-scale scientific computations 
Jt^ X. Shen, W. Liao, A. Choudhary, G. Memik, M. Kandemir, S. More, G. Thiruvathukal, A. Singh 

May 2000 Proceedings of the 14th international conference on Supercomputing ICS 
00 

Publisher: ACM Press 

Full text available- fijl pdf(1 16MB) Additional Information: full citation , abstract , references , citings, index 
" terms 

Our results demonstrate that our novel application development environment provides 
both ease-of-use and high performance for large-scale, I/Ointensive scientific 
applications. 

10 On implementing MPHO portably a n d wi th high performance 
Ra J eev Thakur, William Gropp, Ewing Lusk 

V May 1999 Proceedings of the sixth workshop on I/O in parallel and distributed 
systems IOPADS '99 
Publisher: ACM Press 

Full text available: ^ pdf(887.89 KB) Additional Information: full citation , references , citing s, index terms 



11 A brief survey of current work on network attached peripherals (extend ed abstract) 
Rodney Van Meter 

January 1996 ACM SIGOPS Operating Systems Review, volume 30 issue l 
Publisher: ACM Press 

Full text available:^) pdf(7 25 .39 KB) Additional Information: full citation, abstract , citings, index terms 

Work on network-attached peripherals (NAPs) can be divided into essentially three areas - 
-- device interfaces and protocols, multimedia use and mass storage use. This paper is an 
extended abstract reviewing some of the current work and provides references and WWW 
pointers to many of the projects. The impact of this technological advance on operating 
systems is discussed. The primary purpose of this paper is to broaden understanding of 
the advantages and pitfalls of NAPs and encourage further res ... 
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12 Using high-speed WANs and network data caches to enable remote and distributed 
visualization 

Wes Bethel, Brian Tierney, Jason lee, Dan Gunter, Stephen Lau 

November 2000 Proceedings of the 2000 ACM/IEEE conference on Supercomputing 
(CDROM) Supercomputing '00 

Publisher: IEEE Computer Society 

Full text available: || pdf(3Q2.38 KB) Additional Information: full citation , abstract , references , citings, index 
fl P ubl i s h er Site 

Visapult is a prototype application and framework for remote visualization of large 
scientific datasets. We approach the technical challenges of tera-scale visualization with a 
unique architecture which employs high speed WANs and network data caches for data 
staging and transmission. This architecture allows for the use of available cache and 
compute resources at arbitrary locations on the network. High data throughput rates and 
network utilization are achieved by parallelizing I/O at each ... 

13 GASS: a data movement and access service for wide area computing sy stems 
Joseph Bester, Ian Foster, Carl Kesselman, Jean Tedesco, Steven Tuecke 
May 1999 Proceedings of the sixth workshop on I/O in parallel and distributed 

systems IOPADS '99 
Publisher: ACM Press 

Full text available: ^] pdf(950.99 KB ) Additional Information: full citation , references , citings, index terms 



14 Statemate: a workin g environment for the development of complex reactive systems Q 
D. Harel, H. Lachover, A. Naamad, A. Pnueli, M. Politi, R. Sherman, a. Shtul-Trauring 

April 1988 Proceedings of the 10th international conference on Software engineering 
ICSE '88 

Publisher: IEEE Computer Society Press 

Full text available* fiC) pdf(1 1 9 MB) Additional Information: full citation , abstract , references , citings, index 
"™" ' terms 

This paper provides a brief overview of the STATEMATE system, constructed over the past 
three years by i-Logix Inc., and Ad Cad Ltd. STATEMATE is a graphical working 
environment, intended for the specification, analysis, design and documentation of large 
and complex reactive systems, such as real-time embedded systems, control and 
communication systems, and interactive software. It enables a user to prepare, analyze 
and debug diagrammatic, yet precise, descriptions of the system under devel ... 

15 File server scalin g with network-attached secure disks Q 
Garth A. Gibson, David F. Nagle, Khalil Amiri, Fay W. Chang, Eugene M. Feinberg, Howard 
Gobioff, Chen Lee, Berend Ozceri, Erik Riedel, David Rochberg, Jim Zelenka 

June 1997 ACM SIGMETRICS Performance Evaluation Review , Proceedings of the 
1997 ACM SIGMETRICS international conference on Measurement and 
modeling of computer systems SIGMETRICS '97, volume 25 issue l 

Publisher: ACM Press 

Full text available* fiO pdf(1. 77 MB) Additional Information: full citation , abstract , referenc es, citings, index 

terms 

By providing direct data transfer between storage and client, network-attached storage 
devices have the potential to improve scalability for existing distributed file systems (by 
removing the server as a bottleneck) and bandwidth for new parallel and distributed file 
systems (through network striping and more efficient data paths). Together, these 
advantages influence a large enough fraction of the storage market to make commodity 
network-attached storage feasible. Realizing the technology's ful ... 

16 File and storage systems: A hetero geneous storage g rid enabled b y g rid service Q 
Yuhui Deng, Frank Wang 

January 2007 ACM SIGOPS Operating Systems Review, volume 41 issue l 
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Publisher: ACM Press 

Full text available: ^|pdf(2 02.18 KB ) Additional Information: fu ll citatio n, abstract, references, index terms 

Due to the explosive increase of data, storage Grid is a new model for deploying and 
managing storage resources distributed across multiple systems and networks, making 
efficient use of available storage capacity. Building a storage Grid demands corresponding 
protocols and standards to provide interoperability among the large number of 
heterogeneous storage systems. Service is becoming a basic application pattern of Grid 
because the service offers a standard means of interoperating between diffe ... 

Keywords: architecture, grid service, heterogeneous storage system, interoperability, 
storage grid, web service 



17 Using Hilbert curve in image storing and retrieving 
Zhexuan Song, Nick Roussopoulos 

November 2000 Proceedings of the 2000 ACM workshops on Multimedia MULTIMEDIA 
00 

Publisher: ACM Press 

Full text available: ^pdf( 355.12 KB ) Additional Information: full ci ta tion, abstr act, references, index terms 

In this paper, we present a method to accelerate the speed of retrieving subset of 
uncompressed images in a database without using extra disk space. First we change the 
storing method: pixels of an image are saved in Hilbert order instead of Row-wise order in 
traditional method. After studying the property of Hilbert curve, we give a new algorithm 
which greatly reduces the data segment numbers on the disk. Although we have to 
retrieve more data than necessary, because the speed of sequential ... 

18 Workflow a ppli cat ion s a nd m odels: Workf low automation for processing plasma 
fusion simulation data 

Norbert Podhorszki, Bertram Ludaescher, Scott A. Klasky 

June 2007 Proceedings of the 2nd workshop on Workflows in support of large-scale 

science WORKS 07 
Publisher: ACM Press 

Full text available: Q pdf(511.27 KB) Additional Information: full citation , abstract , references , index terms 

The Center for Plasma Edge Simulation project aims to automate the tedious tasks of 
monitoring the simulation, archiving and post-processing the output. This paper describes 
the tasks and requirements, the several components developed within the Kepler 
workflow system to provide the required functionality and the automated workflow 
solution. Besides functionality, the focus is on the robust execution of the workflow. A 
user-level checkpoint/restart model has been developed that allows the wor ... 

Keywords: Kepler, SSH, checkpointing, plasma fusion, scientific workflow 



1 9 Inte g ratin g parallel f ile I/O and database sup port for hi gh- performance scientific data Q 
mana g ement 

Jaechun No, Rajeev Thakur, Alok Choudhary 

November 2000 Proceedings of the 2000 ACM/IEEE conference on Supercomputing 
(CDROM) Supercomputing '00 

Publisher: IEEE Computer Society 

Full text available: ra&df(174J4 J<B) Additional Information: full citation , abstract , references , citings , index 
W Publisher Site 

Many scientific applications have large I/O requirements, in terms of both the size of data 
and the number of files or data sets. Management, storage, efficient access, and analysis 
of these data present an extremely challenging task. Traditionally, two different solutions 
are used for this problem: file I/O or databases. File I/O can provide high performance 
but is tedious to use with large numbers of files and large and complex data sets. 
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20 S pecial section on advanced XML data processin g : Preservation of di g ital data with Q 

self-validatin g , self-instantiatin g knowled ge-base d archives 
^ Bertram Ludascher, Richard Marciano, Reagan Moore 

September 2001 ACM SIGMOD Record, volume 30 issue 3 

Publisher: ACM Press 

Full text available: g pdf(881,20 KB) Additional Information: full citation , abstract , references , index terms 

Digital archives are dedicated to the long-term preservation of electronic information and 
have the mandate to enable sustained access despite rapid technology changes. 
Persistent archives are confronted with heterogeneous data formats, helper applications, 
and platforms being used over the lifetime of the archive. This is not unlike the 
interoperability challenges, for which mediators are devised. To prevent technological 
obsolescence over time and across platforms, a migration approach for per ... 
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1 Performance analysis of coherency control policies throu g h lock retention 
Asit Dan, Philip S. Yu 

June 1992 ACM SIGMOD Record , Proceedings of the 1992 ACM SIGMOD international 

conference on Management of data SIGMOD '92, volume 21 issue 2 
Publisher: ACM Press 

Additional Information: full citation , abstract , references , citin gs, index 
terms 



Full text available:^ pdf ( 1.18 MB) 



Buffer coherency control can be achieved through retaining a lock (shared, exclusive, 
etc.) on each page in the buffer, even after the requesting transaction has committed. 
Depending upon the lock mode held for retention and the compatibility of lock modes 
specified, different retention policies can be devised. In addition to tracking the validity of 
the buffered data granules, additional capabilities can be provided such as deferred writes 
to support no-force policy on commit, (node) locat ... 

2 A formal semantics for P3P 
Ting Yu, Ninghui Li, Annie I. Anton 

October 2004 Proceedings of the 2004 workshop on Secure web service SWS '04 
Publisher: ACM Press 

Full text available: ^ pdf(268.20 KB) Additional Information: full citation , abstract , references 

The Platform for Privacy Preferences (P3P), developed by the W3C, provides an XML- 
based language for websites to encode their data-collection and data-use practices in a 
machine-readable form. To fully deploy P3P in enterprise information systems and over 
the Web, a well-defined semantics for P3P policies is a must, which is lacking in the 
current P3P framework. Without a formal semantics, a P3P policy may be semantically 
inconsistent and may be interpreted and represented differently by differe ... 



Automated analysis of P3P-enabled Web sites 
Simon Byers, Lorrie Faith Cranor, David Kormann 

September 2003 Proceedings of the 5th international conference on Electronic 
commerce ICEC '03 

Publisher: ACM Press 

Additional Information: full citation, abstract, references, citings, index 
terms , review 



Full text available: g pdfd 93.46 KB) 



The Platform for Privacy Preferences (P3P) provides a standard computer-readable format 
for privacy policies and a protocol that enables web browsers to read and process these 
policies automatically. We developed software to query a set of web sites for P3P policies, 
check the validity of each policy, and analyze the information practices it describes. We 
used this software to analyze 588 P3P-enabled web sites found by checking for P3P 
policies on 5,856 web sites on 17 July 2003. The sites we ch ... 



Page 1 



http://portal.acm.org/resu^ 



Keywords: P3P, platform for privacy preferences, privacy, privacy policy 

4 The platform for privac y preference as a social protocol: An examination within the 

U.S. policy context 
Harry Hochheiser 

November 2002 ACM Transactions on Internet Technology (TOIT), volume 2 issue 4 
Publisher: ACM Press 

Full text available- *Pl odf(241 03 KB) Add ' tionaI Information: full citation , abstract , references , citings, index 
u v 1 T2d 15 : — — terms 

As a "social protocol" aimed at providing a technological means to address concerns over 
Internet privacy, the Platform for Privacy Preferences (P3P) has been controversial since 
its announcement in 1997. In the U.S., critics have decried P3P as an industry attempt to 
avoid meaningful privacy legislation, while developers have portrayed the proposal as a 
tool for helping users make informed decisions about the impact of their Web surfing 
choices. This dispute touches upon the privacy model under ... 

Keywords: P3P, Privacy, social protocols 



5 Secure file system versionin g at the block level 
Jake Wires, Michael J. Feeley 

March 2007 ACM SIGOPS Operating Systems Review , Proceedings of the 2007 

conference on EuroSys EuroSys '07, volume 41 issue 3 
Publisher: ACM Press 

Full text available: ^ pdf ( 406.68 KB ) Additional Information: full citation , abstract , references 

In typical file systems, valuable data is vulnerable to being accidentally or maliciously 
deleted or overwritten. Versioning file systems protect data from accidents by 
transparently retaining old versions, but do less well in protecting data from malicious 
attack. These systems remain vulnerable to attackers who gain unauthorized access to 
prune old file versions, who bypass the file system to directly manipulate storage, or who 
exploit bugs in any part of the operating system. 

This p ... 

E-P3P privacy policies and privacy authorization 
Paul Ashley, Satoshi Hada, Gunter Karjoth, Matthias Schunter 

November 2002 Proceedings of the 2002 ACM workshop on Privacy in the Electronic 

Society WPES '02 
Publisher: ACM Press 

Full text available '1S Ddf(146 35 KB) Additional Information: full citation , abstract , references , citings, index 
' ^ : terms 

Enterprises collect large amounts of personal data from their customers. To ease privacy 
concerns, enterprises publish privacy statements that outline how data is used and 
shared. The Platform for Enterprise Privacy Practices (E-P3P) defines a fine-grained 
privacy policy model. A Chief Privacy Officer can use E-P3P to formalize the desired 
enterprise-internal handling of collected data. A particular data user is then allowed to use 
certain collected data for a given purpose if and only if the E- ... 

Keywords: E-P3P, privacy manager, privacy policies 



7 S ystems: Multi-fidelity stora ge 

Padmanabhan Pillai, Van Ke, Jason Campbell 

October 2004 Proceedings of the ACM 2nd international workshop on Video 
surveillance & sensor networks VSSN '04 
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Imaging sensors are inherently high bandwidth devices, and applications which store 
image data often encounter disk or memory limits. Commonly, upon reaching such a 
limit, storage systems will cease sampling or overwrite existing data in an oldest-first 
fashion. For most applications, neither approach is optimal. We introduce a flexible, 
policy-based, on-line algorithm for maximizing the application-specific value of data 
' retained on a read/write/erase storage medium such as a hard disk or fl ... 

Keywords: variable fidelity, video surveillance, weighted fair-share storage 



8 Decidin g when to for g et in the Elephant file system 

Douglas S. Santry, Michael J. Feeley, Norman C. Hutchinson, Alistair C. Veitch, Ross W. 
Carton, Jacob Ofir 

December 1999 ACM SIGOPS Operating Systems Review , Proceedings of the 

seventeenth ACM symposium on Operating systems principles SOSP 

'99, Volume 33 Issue 5 
Publisher: ACM Press 

Full text available* fP) df(1 61 MB) Additional Information: full citation , abstract , references , citin gs, index 
' ™ " terms 

Modern file systems associate the deletion of a file with the immediate release of storage, 
and file writes with the irrevocable change of file contents. We argue that this behavior is 
a relic of the past, when disk storage was a scarce resource. Today, large cheap disks 
make it possible for the file system to protect valuable data from accidental delete or 
overwrite.This paper describes the design, implementation, and performance of the 
Elephant file system, which automatically retains all impo ... 



9 Data protection and data sharing in telematics 

Sastry Duri, Jeffrey Elliott, Marco Gruteser, Xuan Liu, Paul Moskowitz, Ronald Perez, 
Moninder Singh, Jung-Mu Tang 

December 2004 Mobile Networks and Applications, volume 9 issue 6 
Publisher: Kluwer Academic Publishers 

Full text available: pdf( 303.67 KB ) Additional Information: full citat ion, abstract, references , index terms 

Automotive telematics may be defined as the information-intensive applications enabled 
for vehicles by a combination of telecommunications and computing technology. 
Telematics by its nature requires the capture, storage, and exchange of sensor data to 
obtain remote services. Such data likely include personal, sensitive information, which 
require proper handling to protect the driver's privacy. Some existing approaches focus on 
protecting privacy through anonymous interactions or by stopping i ... 

Keywords: automotive telematics, data protection architecture, privacy policies 
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Publisher: ACM Press 

Full text available: pdf (172.55 KB ) Additional Information: f ull c itation, abstract , index t erms 

As we move our social institutions from paper and ink based operations to the electronic 
medium, we invisibly create a type of surveillance society, a panopticon society. It is not 
the traditional surveillance society in which government officials follow citizens around 
because they are concerned about threats to the political order. Instead it is piecemeal 
surveillance by public and private organizations. Piecemeal though it is, It creates the 
potential for the old kind of surveillan ... 
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11 User interfaces for privacy agents 

Lorrie Faith Cranor, Praveen Guduru, Manjula Arjula 

June 2006 ACM Transactions on Computer-Human Interaction (TOCHI), volume 13 issue 
2 

Publisher: ACM Press 

Full text available: ^| pdf(1 .82 MB) Additional Information: full citation, abstract, references, index terms 

Most people do not often read privacy policies because they tend to be long and difficult 
• to understand. The Platform for Privacy Preferences (P3P) addresses this problem by 
providing a standard machine-readable format for website privacy policies. P3P user 
agents can fetch P3P privacy policies automatically, compare them with a user's privacy 
preferences, and alert and advise the user. Developing user interfaces for P3P user agents 
is challenging for several reasons: privacy policies are comple ... 

Keywords: P3P, preferences, privacy, privacy enhancing technology, privacy policy, user 
agent 
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Mark S. Ackerman 

November 2004 Personal and Ubiquitous Computing, volume 8 issue 6 
Publisher: Springer-Verlag 

Full text available:^ pdf(221 .64 KB ) Additional Information: full citation, abstract, citin gs, ind ex terms 

In pervasive environments, privacy is likely to be a major issue for users, and users will 
want to be notified of potential data capture. To provide notice to users, this paper argues 
for what it calls labeling protocols, technical mechanisms through which users can be 
informed of data requests and their consequences. Recent experiences with the Platform 
for Privacy Preferences Project (P3P), an attempt to provide privacy mechanisms for the 
Web, suggest important lessons for the design of a n ... 

Keywords: Labeling protocols, P3P, Pervasive environments, Platform for privacy 
preferences, Privacy, Ubiquitous computing 
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15 Use of a P3P user agent by early adopters 
Lorrie Faith Cranor, Manjula Arjula, Praveen Guduru 

November 2002 Proceedings of the 2002 ACM workshop on Privacy in the Electronic 
Society WPES 02 

Publisher: ACM Press 

Full text available- fi3 odf(450 21 KB) Addjtional Information: fu ll c it at ion, abstr a ct , re fe renc es , citings, index 

: terms , review 

The Platform for Privacy Preferences (P3P), developed by the World Wide Web Consortium 
(W3C), provides a standard computer-readable format for privacy policies and a protocol 
that enables web browsers to read and process privacy policies automatically. P3P enables 
machine-readable privacy policies that can be retrieved automatically by web browsers 
and other user agent tools that can display symbols, prompt users, or take other 
appropriate actions. We developed the AT&T Privacy Bird as a P3P use ... 

Keywords: P3P, privacy, survey, user agent, user study 



16 Protocols: An XPath-based prefe r e n ce la ng ua ge for P3P 
^ Rakesh Agrawal, Jerry Kiernan, Ramakrishnan Srikant, Yirong Xu 
>^ May 2003 Proceedings of the 12th international conference on World Wide Web 
WWW '03 
Publisher: ACM Press 

Full text available- "H pdf(1 07 76 KB) Addltional Information: full citation, abstract, references, citings, index 
. [£j terms 

The Platform for Privacy Preferences (P3P) is the most significant effort currently 
underway to enable web users to gain control over their private information. The 
designers of P3P simultaneously designed a preference language called APPEL to allow 
users to express their privacy preferences, thus enabling automatic matching of privacy 
preferences against P3P policies. Unfortunately subtle interactions between P3P and 
APPEL result in serious problems when using APPEL: Users can only directly spe ... 

Keywords: APPEL, P3P, XPath, XPref, hippocratic databases, preference, privacy-aware 
data management 



1 7 How ordinary internet users can have a chance to influ ence privacy policies 
^ John Soren Pettersson, Simone Fischer-Hubner, Marco Casassa Mont, Siani Pearson 

October 2006 Proceedings of the 4th Nordic conference on Human-computer 
interaction: changing roles NordiCHI '06 

Publisher: ACM Press 

Full text available: Q pdf ( 534.06 KB ) Additional Information: full citation , abstract , references , index terms 

By 'Obligation Management' we refer to the definition, automated enforcement, and 
monitoring of privacy obligation policies. Privacy policies are nowadays found on most 
organisations' web pages, especially when data is directly collected from the 
user/customer. The paper demonstrates how users can influence rather than merely 
accept a privacy policy, and further relates this to the larger framework within which 
policy compliance should be discussed. Four problem areas are identified (from design 

Keywords: obligation management, privacy policy, user interfaces 
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November 2006 Proceedings of the second ACM workshop on Digital identity 
management DIM '06 

Publisher: ACM Press 

Full text available: ^| pdf (655.02 KB ) Additional Information: full citation , abstract , references , index terms 

The goal of this paper is to study secure and usable methods for providing user input to a 
website. Three principles define security for us: certification, awareness, and privacy. Four 
principles define usability: contextual awareness, semantic awareness, prodigious use of 
screen space, and the availability of recommended choices. We first describe how current 
approaches to the solicitation of user input on the web fail on both fronts: they either can 
not handle certified data, do not resp ... 

Keywords: user interface designs 



19 B2B e-commerce and enterprise integration: Towards end-to-end privacy control in 
^ the outsourcing of marketin g activities: a web service inte gra tion soluti on 
^ Patrick C. K. Hung, Dickson K. W. Chiu, W. W. Fung, William K. Cheung, Raymond Wong, 
Samuel P. M. Choi, Eleanna Kafeza, James Kwok, Jousha C. C. Pun, Vivying S. Y. Cheng 
August 2005 Proceedings of the 7th international conference on Electronic commerce 

ICEC05 
Publisher: ACM Press 

Full text available: ^] p df(4 38.26 K B) Additional Information: full citation , abstract , refer en c es , index terms 

With the recent adoption of marketing activities outsourcing, there have been increasing 
demands and concerns for privacy control. The traditional approach of a bulk transmission 
of the customers' information to a marketing company cannot meet such demands, 
especially in the finance and healthcare businesses. Therefore, we propose a layered 
architecture and a development methodology for end-to-end privacy control over the 
export of each individual customer's records through a Web services platfo ... 

Keywords: EPAL, OWL, SOAP, UDDI, WSDL, Web service integration, need-to-know 
principle, privacy policies 



20 Enterprise privacy promises and enforcement 
Adam Barth, John C. Mitchell 

January 2005 Proceedings of the 2005 workshop on Issues in the theory of security 
WITS '05 

Publisher: ACM Press 

Full text available: ^ pdf( 261 .05 KB ) Additional Information: f ul l citation, abstract, references , index terms 

Several formal languages have been proposed to encode privacy policies, ranging from 
the Platform for Privacy Preferences (P3P), intended for communicating privacy policies to 
consumers over the web, to the Enterprise Privacy Authorization Language (EPAL), 
intended to enable policy enforcement within an enterprise. However, current technology 
does not allow an enterprise to determine whether its detailed, internal enforcement 
policy meets its published privacy promises. We present a data-centric ... 

Keywords: EPAL, P3P, modal logic, policy summary, privacy policy 
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